Can Gold " Cope " with Wals? Retrofitting an Ontology onto the World Atlas of Language Structures
نویسندگان
چکیده
0. Introduction The World Atlas of Language Structures (WALS, Haspelmath et al. 2005) is a large-scale “database of databases” consisting of 141 typological databases, covering a wide range of grammatical features, joined into one composite resource through the use of a common metadata scheme. While this metadata scheme ensures interoperability among databases across some dimensions (e.g., language names and families), it falls far short of allowing complete database interoperability. At present, a project is underway to “retrofit” an ontology onto this existing resource. Two broad questions being addressed by the project are: (i) What conceptual and design problems need to be solved in order to build an ontology “internal” to WALS which can allow for a high degree of interoperability among the WALS databases? and (ii) How can the WALS categories be related to a general ontology? Or, to put it another way, we are interested in determining (i) how we can build a worthwhile Community of Practice Extension (or COPE) for WALS and (ii) how the categories in this COPE can be related to categories in the General Ontology for Linguistic Description (henceforth, GOLD ontology; Farrar and Langendoen (2003)). This paper is structured as follows. Section 1 gives some background information on WALS, including discussion of its history and its overall design. Section 2 discusses the basic methodology adopted by the WALS ontology project. This section also discusses some of the research results of the WALS ontology project which we believe may be relevant to the conceptual relationship between a typological COPE and the GOLD ontology. Section 3 discusses some ways in which we believe GOLD could be extended to better relate to typological concepts and some design desiderata for ontological tools which would facilitate the creation of an ontology like the WALS ontology. Finally, section 4 summarizes the current findings of this ongoing project.
منابع مشابه
Investigating Efficiency of Shotcrete for Retrofitting Masonry Buildings
One of a feasible and efficient method to retrofit structures is spraying shotcrete which is widely applied around the world. Shotcrete is concrete with fine aggregates which are sprayed through a hose and by air pressure coat at high velocity onto a surface. In the current research, three masonry schools from different regions of Iran are selected. The retrofitted wall surfaces have been prepa...
متن کاملWALS in the university classroom: A review
The world atlas of language structures (WALS) originally appealed to the linguistics community as a resource for research. However, the relevance of the feature chapters to teaching environments and the user-friendly nature of the Interactive Reference Tool also make it suitable for university classrooms. Based on our experiences using WALS in two typology courses at the University of Mancheste...
متن کاملFrom Phonology to Syntax: Unsupervised Linguistic Typology at Different Levels with Language Embeddings
A core part of linguistic typology is the classification of languages according to linguistic properties, such as those detailed in the World Atlas of Language Structure (WALS). Doing this manually is prohibitively time-consuming, which is in part evidenced by the fact that only 100 out of over 7,000 languages spoken in the world are fully covered in WALS. We learn distributed language represen...
متن کاملHow Good are Typological Distances for Determining Genealogical Relationships among Languages?
The recent availability of typological databases such as World Atlas of Language Structures (WALS) has spurred investigations regarding their utility for language classification, the stability of typological features in genetic linguistics and typological universals across the language families of the world. Existing work on building NLP resources such as parallel corpora, treebanks for under-r...
متن کاملClassifying Syntactic Regularities for Hundreds of Languages
This paper presents a comparison of classification methods for linguistic typology for the purpose of expanding an extensive, but sparse language resource: the World Atlas of Language Structures (WALS) (Dryer and Haspelmath, 2013). We experimented with a variety of regression and nearest-neighbor methods for use in classification over a set of 325 languages and six syntactic rules drawn from WA...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2005